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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



{ ix ) FEATukE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 166.. 1755 

(D) OTHER INFORMATION: /product= "ALPHA- 2 SUBUNIT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



CAATGACCTG 


TTTTCTTCTG 


TAACCACAGG 


TTCGGTGGTG 


AGAGGAASCY 


TCGCAGAATC 


60 


CAGCAGAATC 


CTCACAGAAT 


CCAGCAGCAG 


CTCTGCTGGG 


GACATGGTCC 


ATGGTGCAAC 


120 


CCACAGCAAA 


GCCCTGACCT 


GACCTCCTGA 


TGCTCAGGAG 


AAGCCATGGG 


CCCCTCCTGT 


180 


CCTGTGTTCC 


TGTCCTTCAC 


AAAGCTCAGC 


CTGTGGTGGC 


TCCTTCTGAC 


CCCAGCAGGT 


240 


GGAGAGGAAG 


CTAAGCGCCC 


ACCTCCCAGG 


GCTCCTGGAG 


ACCCACTCTC 


CTCTCCCAGT 


300 


CCCACGGCAT 


TGCCGCAGGG 


AGGCTCGCAT 


ACCGAGACTG 


AGGACCGGCT 


CTTCAAACAC 


360 


CTCTTCCGGG 


GCTACAACCG 


CTGGGCGCGC 


CCGGTGCCCA 


ACACTTCAGA 


CGTGGTGATT 


420 


GTGCGCTTTG 


GACTGTCCAT 


CGCTCAGCTC 


ATCGATGTGG 


ATGAGAAGAA 


CCAAATGATG 


480 


ACCACCAACG 


TCTGGCTAAA 


ACAGGAGTGG 


AGCGACTACA 


AACTGCGCTG 


GAACCCCGCT 


540 


GATTTTGGCA 


ACATCACATC 


TCTCAGGGTC 


CCTTCTGAGA 


TGATCTGGAT 


CCCCGACATT 


600 


GTTCTCTACA 


ACAATGCAGA 


TGGGGAGTTT 


GCAGTGACCC 


ACATGACCAA 


GGCCCACCTC 


660 


TTCTCCACGG 


GCACTGTGCA 


CTGGCTGCCC 


CCOGCCATCT 


ACAAGAGCTC 


CTGCAGCATC 


720 


GACGTCACCT 


TCTTCCCCTT 


CGACC^GCAG 


AACTGCAAGA 


TGAAGTTTGG 


CTCCTGGACT 


780 


TATGACAAGG 


CCAAGATCGA 


CCTGGAGCAG 


ATGGAGCAGA 


CTGTGGACCT 


GAAGGACTAC 


840 


TGGGAGAGCG 


GCGAGTGGGC 


CATCGTCAAT 


GCCACGGGCA 


CCTACAACAG 


CAAGAAGTAC 


900 


GACTGCTGCG 


CCGAGATCTA 


CCCCGACGTC 


ACCTACGCCT 


TCGTCATCCG 


GCGGCTGCCG 


960 
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CTCTTCTACA 


CCATCAACCT 


CATCATCCCC 


TGCCTGCTCA 


TCTCCTGCCT 


CACTGTGCTG 


1020 


GTCTTCTACC 


TGCCCTCCGA 


CTGCGGCGAG 


AAGATCACGC 


TGTGCATTTC 


GGTGCTGCTG 


1080 


TCACTCACCG 


TCTTCCTGCT 


GCTCATCACT 


GAGATCATCC 


CGTCCACCTC 


GCTGGTCATC 


1140 


CCGCTCATCG 


GCGAGTACCT 


GCTGTTCACC 


ATGATCTTCG 


TCACCCTGTC 


CATCGTCATC 


1200 


ACCGTCTTCG 


TGCTCAATGT 


GCACCACCGC 


TCCCCCAGCA 


CCCACACCAT 


GCCCCACTGG 


1260 


GTGCGGGGGG 


CCCTTCTGGG 


CTGTGTGCCC 


CGGTGGCTTC 


TGATGAACCG 


GCCCCCACCA 


1320 


CCCGTGGAGC 


TCTGCCACCC 


CCTACGCCTG 


AAGCTCAGCC 


CCTCTTATCA 


CTGGCTGGAG 


1380 


AGCAACGTGG 


ATGCCGAGGA 


GAGGGAGGTG 


GTGGTGGAGG 


AGGAGGACAG 


ATGGGCATGT 


1440 


GCAGGTCATG 


TGGCCCCCTC 


TGTGGGCACC 


CTCTGCAGCC 


ACGGCCACCT 


GCACTCTGGG 


1500 


GCCTCAGGTC 


CCAAGGCTGA 


GGCTCTGCTG 


CAGGAGGGTG 


AGCTGCTGCT 


ATCACCCCAC 


1560 


ATGCAGAAGG 


CACTGGAAGG 


TGTGCACTAC 


ATTGCCGACC 


ACCTGCGGTC 


TGAGGATGCT 


1620 


GACTCTTCGG 


TGAAGGAGGA 


CTGGAAGTAT 


GTTGCCATGG 


TCATCGACAG 


GATCTTCCTC 


1680 


TGGCTGTTTA 


TCATCGTCTG 


CTTCCTGGGG 


ACCATCGGCC 


TCTTTCTGCC 


TCCGTTCCTA 


1740 


GCTGGAATGA 


TCTGACTGCA 


CCTCCCTCGA 


GCTGGCTCCC 


AGGGCAAAGG 


GGAGGGTTCT 


1800 


TGGATGTGGA 


AGGGCTTTGA 


ACAATGTTTA 


GATTTGGAGA 


TGAGCCCAAA 


GTGCCAGGGA 


1860 


GAACAGCCAG 


GTGAGGTGGG 


AGGTTGGAGA 


GCCAGGTGAG 


GTCTCTCTAA 


GTCAGGCTGG 


1920 


GGTTGAAGTT 


TGGAGTCTGT 


CCGAGTTTGC 


AGGGTGCTGA 


GCTGTATGGT 


CCAGCAGGGG 


1980 


AGTAATAAGG 


GCTCTTCCGG 


AAGGGGAGGA 


AGC GGGAGGC 


AGGGCCTGCA 


CCTGATGTGG 


2040 


AGGTACAGGG. 


CAGATCTTCC 


CTACCGGGGA 


GGGATGGATG 


GTTGGATACA 


GGTGGCTGGG 


2100 


CTATTCCATC 


CATCTGGAAG 


CACATTTGAG 


CCTCCAGGCT 


TCTCCTTGAC 


GTCATTCCTC 


2160 


TCCTTCCTTG 


CTCCAAAATG 


GCTCTGCACC 


AGCCGGCCCC 


CAGGAGGTCT 


GGCAGAGCTG 


2220 


AGAGCCATGG 


CCTGCAGGGG 


CTCCATATGT 


CCCTACGCGT 


GCAGCAGGCA 


AACAAGA 


2277 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 529 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Gly Pro Ser Cys Pro Val Phe Leu Ser Phe Thr Lys Leu Ser Leu 
1.5 10 15 

Trp Trp Leu Leu Leu Thr Pro Ala Gly Gly Glu Glu Ala Lys Arg Pro 
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20 25 30 

Pro Pro Arg Ala Pro Gly Asp Pro Leu Ser Ser Pro Ser Pro Thr Ala 
35 40 45 

Leu Pro Gin Gly Gly Ser His Thr Glu Thr Glu Asp Arg Leu Phe Lys 
50 55 60 

His Leu Phe Arg Gly Tyr Asn Arg Trp Ala Arg Pro Val Pro Asn Thr 
65 70 75 80 

Ser Asp Val Val lie Val Arg Phe Gly Leu Ser He Ala Gin Leu He 
85 90 95 

Asp Val Asp Glu Lys Asn Gin Met Met Thr Thr Asn Val Trp Leu Lys 
100 105 110 

Gin Glu Trp Ser Asp Tyr Lys Leu Arg Trp Asn Pro Ala Asp Phe Gly 
115 120 125 

Asn He Thr Ser Leu Arg Val Pro Ser Glu Met He Trp He Pro Asp 
130 135 140 

He Val Leu Tyr Asn Asn Ala Asp Gly Glu Phe Ala Val Thr His Met 
145 150 155 160 

Thr Lys Ala His Leu Phe Ser Thr Gly Thr Val His Trp Val Pro Pro 
165 170 175 

Ala lie Tyr Lys Ser Ser Cys Ser lie Asp Val Thr Phe Phe Pro Phe 
180 185 190 

Asp Gin Gin Asn Cys Lys Met Lys Phe Gly Ser Trp Thr Tyr Asp Lys 
195 200 205 

Ala Lys lie Asp Leu Glu Gin Met Glu Gin Thr Val Asp Leu Lys Asp 
210 215 220 

Tyr Trp Glu Ser Gly Glu Trp Ala lie Val Asn Ala Thr Gly Thr Tyr 
225 230 235 240 

Asn Ser Lys Lys Tyr Asp Cys Cys Ala Glu lie Tyr Pro Asp Val Thr 
245 250 255 

Tyr Ala Phe Val lie Arg Arg Leu Pro Leu Phe Tyr Thr lie Asn Leu 
260 265 270 

lie lie Pro Cys Leu Leu lie Ser Cys Leu Thr Val Leu Val Phe Tyr 
275 280 285 

Leu Pro Ser Asp Cys Gly Glu Lys lie Thr Leu Cys lie Ser Val Leu 
290 295 300 

Leu Ser Leu Thr Val Phe Leu Leu Leu lie Thr Glu lie lie Pro Ser 
305 310 315 320 

Thr Ser Leu Val He Pro Leu lie Gly Glu Tyr Leu Leu Phe Thr Met 
325 330 335 

lie Phe Val Thr Leu Ser lie Val lie Thr Val Phe Val Leu Asn Val 
340 345 350 
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His His Arg Ser Pro Ser Thr His Thr Met Pro His Trp Val Arg Gly 
355 360 365 

Ala Leu Leu Gly Cys Val Pro Arg Trp Leu Leu Met Asn Arg Pro Pro 
370 375 380 

Pro Pro Val Glu Leu Cys His Pro Leu Arg Leu Lys Leu Ser Pro Ser 
385 390 395 400 

Tyr His Trp Leu Glu Ser Asn Val Asp Ala Glu Glu Arg Glu Val Val 
405 410 415 

Val Glu Glu Glu Asp Arg Trp Ala Cys Ala Gly His Val Ala Pro Ser 
420 425 430 

Val Gly Thr Leu Cys Ser His Gly His Leu His Ser Gly Ala Ser Gly 
435 440 445 

Pro Lys Ala Glu Ala Leu Leu Gin Glu Gly Glu Leu Leu Leu Ser Pro 
450 455 460 

His Met Gin Lys Ala Leu Glu Gly Val His Tyr lie Ala Asp His Leu 
465 470 475 480 

Arg Ser Glu Asp Ala Asp Ser Ser Val Lys Glu Asp Trp Lys Tyr Val 
485 490 495 

Ala Met Val lie Asp Arg lie Phe Leu Trp Leu Phe lie lie Val Cys 
500 505 510 

Phe Leu Gly Thr lie Gly Leu Phe Leu Pro Pro Phe Leu Ala Gly Met 
515 520 525 

He 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 39.. 1553 

(D) OTHER INFORMATION: /product= "ALPHA- 3 SUBUNIT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

CCGACCGTCC GGGTCCGCGG CCAGCCCGGC CACCAGCCAT GGGCTCTGGC CCGCTCTCGC 60 

TGCCCCTGGC GCTGTCGCCG CCGCGGCTGC TGCTGCTGCT GCTGTCTCTG CTGCCAGTGG 120 

CCAGGGCCTC AGAGGCTGAG CACCGTCTAT TTGAGCGGCT GTTTGAAGAT TACAATGAGA 180 
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TCATCCGGCC 


TGTAGCCAAC 


GTGTCTGACC 


CAGTCATCAT 


CCATTTCGAG 


GTGTCCATGT 


240 


CTCAGCTGGT 


GAAGGTGGAT 


GAAGTAAACC 


AGATCATGGA 


GACCAACCTG 


TGGCTCAAGC 


300 


AAATCTGGAA 


TGACTACAAG 


CTGAAGTGGA 


ACCCCTCTGA 


CTATGGTGGG 


GCAGAGTTCA 


360 


TGCGTGTCCC 


TGCACAGAAG 


ATCTGGAAGC 


CAGACATTGT 


GCTGTATAAC 


AATGCTGTTG 


420 


GGGATTTCCA 


GGTGGACGAC 


AAGACCAAAG 


CCTTACTCAA 


GTACACTGGG 


GAGGTGACTT 


480 


GGATACCTCC 


GGCCATCTTT 


AAGAGCTCCT 


GTAAAATCGA 


CGTGACCTAC 


TTC C C GTTTG 


540 


ATTACCAAAA 


CTGTACCATG 


AAGTTCGGTT 


CCTGGTCCTA 


CGATAAGGCG 


AAAATCGATC 


600 


TGGTCCTGAT 


CGGCTCTTCC 


ATGAACCTCA 


AGGACTATTG 


GGAGAGCGGC 


GAGTGGGCCA 


660 


TCATCAAAGC 


CCCAGGCTAC 


AAACACGACA 


TCAAGTACAG 


CTGCTGCGAG 


GAGATCTACC 


720 


CCGACATCAC 


ATACTCGCTG 


WWCATCCGGC 


GGCTGTCGTT 


GTTCTACACC 


ATCAWCCTCA 


780 


TCATCCGCTG 

X ** X X*- V* \J w X \J 


GCTGATCATC 


TCCTTCATCA 


CTGTGGTCGT 


CTCCTACCTG 


CCCTCCGACT 


840 


GC GGCGAGAA 


GGTGACCCTG 


TGYATTTCTG 


TCCTCCTCTC 


CCTGACGGTG 


TTTC TC C TGG 


900 


TGATCACTGA 


GACCATCCCT 


TCCACCTCGC 


TGGTCATCCC 


CCTGATTGGA 


GAGTACCTCC 


960 


TGWWCACCAT 


GATTTGTGTA 

VJ** XXX \J X VJ X ** 


ACCTTGTCCA 


TCGACATCAC 


CGTCTGCGTG 


CTCAACGTGC 


1020 


ACTACAGAAC 


CCCGACGACA 


CACACAATGC 

^^**^^ ***_***** X WW 


CCTCATGGGT 


GAAGACTGTA 


TTCTTGAMCC 


1080 


TGCTCCCCAG 

X X \«* \«* \»***\J 


GGTCATGTWC 


ATGACCAGGC 


CAACAAGCAA 


CGAGGGCAAC 


GCTCAGAAGC 


1140 


CGAGGCCCCT 


CTACGGTGCC 


GAGCTCTCAA 


ATCTGAATTG 


CTTCAGCCGC 


GCAGAGTCCA 


1200 


AAGGCTGCAA 


GGAGGGCTAC 


CCCTGCCAGG 


ACGGGATGTG 


TGGTTACTGC 


CACCACCGCA 


1260 


GGATAAAAAT 

\J\Jm \ X ********** X 


CTCCAATTTC 


AGTGCTAACC 


TCACGAGAAG 


CTCTAGTTCT 


GAATCTGTTG 


1320 


ATGCTGTGCT 


GTCCCTCTCT 


GCTTTGTCAC 


CAGAAATCAA 


AGAAGCCATC 


CAAAGTGTCA 


1380 


AGTATATTGC 


TGAAAATATG 


AAAGCACAAA 


ATGAAGCCAA 


AGAGATTCAA 


GATGATTGGA 


1440 


AGTATGTTGC 


CATGGTGATT 


GATCGTATTT 


TTCTGTGGGT 


TTTCACCCTG 


GTGTGCATTC 


1500 


TAGGGACAGC 


AGGATTGTTT 


CTGCAACCCC 


TGATGGCCAG 


GGAAGATGCA 


TAAGCACTAA 


1560 


GCTGTGTGCC • 


TGCCTGGGAG 


ACTTCCTTGT 


GTCAGGGCAG 


GAGGAGGCTG 


CTTCCTAGTA 


1620 


AGAACGTACT 


TTCTGTTATC 


AAGCTACCAG 


CTTT 






1654 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
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Met Gly Ser Gly Pro Leu Ser Leu Pro Leu Ala Leu Ser Pro Pro Arg 
15 10 15 

Leu Leu Leu Leu Leu Leu Ser Leu Leu Pro Val Ala Arg Ala Ser Glu 
20 25 30 

Ala Glu His Arg Leu Phe Glu Arg Leu Phe Glu Asp Tyr Asn Glu lie 
35 40 45 

lie Arg Pro Val Ala Asn Val Ser Asp Pro Val lie lie His Phe Glu 
50 55 60 

Val Ser Met Ser Gin Leu Val Lys Val Asp Glu Val Asn Gin He Met 
65 70 75 80 

Glu Thr Asn Leu Trp Leu Lys Gin He Trp Asn Asp Tyr Lys Leu Lys 
85 90 95 

Trp Asn Pro Ser Asp Tyr Gly Gly Ala Glu Phe Met Arg Val Pro Ala 
100 105 110 

Gin Lys He Trp Lys Pro Asp He Val Leu Tyr Asn Asn Ala Val Gly 
115 120 125 

Asp Phe Gin Val Asp Asp Lys Thr Lys Ala Leu Leu Lys Tyr Thr Gly 
130 135 140 

Glu Val Thr Trp He Pro Pro Ala He Phe Lys Ser Ser Cys Lys He 
145 150 155 160 

Asp Val Thr Tyr Phe Pro Phe Asp Tyr Gin Asn Cys Thr Met Lys Phe 
165 170 175 

Gly Ser Trp Ser Tyr Asp Lys Ala Lys lie Asp Leu Val Leu He Gly 
180 185 190 

Ser Ser Met Asn Leu Lys Asp Tyr Trp Glu Ser Gly Glu Trp Ala He 
195 200 205 

He Lys Ala Pro Gly Tyr Lys His Asp He Lys Tyr Ser Cys Cys Glu 
210 215 220 

Glu lie Tyr Pro Asp lie Thr Tyr Ser Leu Xaa lie Arg Arg Leu Ser 
225 230 235 240 

Leu Phe Tyr Thr lie Xaa Leu lie lie Arg Trp Leu lie lie Ser Phe 
245 250 255 

lie Thr Val Val Val Ser Tyr Leu Pro Ser Asp Cys Gly Glu Lys Val 
260 265 270 

Thr Leu Cys lie Ser Val Leu Leu Ser Leu Thr Val Phe Leu Leu Val 
275 280 285 

lie Thr Glu Thr lie Pro Ser Thr Ser Leu Val lie Pro Leu lie Gly 
290 295 300 

Glu Tyr Leu Leu Xaa Thr Met lie Cys Val Thr Leu Ser lie Asp lie 
305 310 315 320 

Thr Val Cys Val Leu Asn Val His Tyr Arg Thr Pro Thr Thr His Thr 



7 



SD9951 



325 330 335 

Met Pro Ser Trp Val Lys Thr Val Phe Leu Xaa Leu Leu Pro Arg Val 
340 345 350 

Met Xaa Met Thr Arg Pro Thr Ser Asn Glu Gly Asn Ala Gin Lys Pro 
355 360 365 

Arg Pro Leu Tyr Gly Ala Glu Leu Ser Asn Leu Asn Cys Phe Ser Arg 
370 375 380 

Ala Glu Ser Lys Gly Cys Lys Glu Gly Tyr Pro Cys Gin Asp Gly Met 
385 390 395 400 

Cys Gly Tyr Cys His His Arg Arg lie Lys lie Ser Asn Phe Ser Ala 
405 410 415 

Asn Leu Thr Arg Ser Ser Ser Ser Glu Ser Val Asp Ala Val Leu Ser 
420 425 430 

Leu Ser Ala Leu Ser Pro Glu He Lys Glu Ala He Gin Ser Val Lys 
435 440 445 

Tyr He Ala Glu Asn Met Lys Ala Gin Asn Glu Ala Lys Glu He Gin 
450 455 460 

Asp Asp Trp Lys Tyr Val Ala Met Val He Asp Arg He- Phe Leu Trp 
465 470 , 475 *80 

Val Phe Thr Leu Val Cys He Leu Gly Thr Ala Gly Leu Phe Le* Gin 
485 490 495 

Pro Leu Met Ala Arg Glu Asp Ala 
500 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2363 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 173.. 2056 

(D) OTHER INFORMATION: /product= "ALPHA- 4 SUBUNIT" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCGCTCGCTG CGGCGCCGCC GCCGCNCCGC GCGCCACAGG AGAAGGCGAN CCGGGCCCGG 60 

CGGCCGAAGC GGCCCGCGAG GCGCGGGAGG CATGAAGTTG GGCGCGCACG GGCCTCGAAG 120 

CGGCGGGGAG CCGGGAGCCG CCCGCATCTA GAGCCCGCGA GGTGCGTGCG CCATGGAGCT 180 

AGGGGGCCCC GGAGCGCCGC GGCTGCTGCC GCCGCTGCTG CTGCTTCTGG GGACCGGCCT 240 
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CCTGCGCGCC 


AGCAGCCATG 


TGGAGACCCG 


GGCCCACGCC 


GAGGAGCGGC 


TCCTGAAGAA 


300 


ACTCTTCTCC 


GGTTACAACA 


AGTGGTCCCG 


ACCCGTGGCC 


AACATCTCGG 


ACGTGGTCCT 


360 


CGTCCGCTTC 


GGCCTGTCCA 


TCGCTCAGCT 


CATTGACGTG 


GATGAGAAGA 


ACCAGATGAT 


420 


GACCACGAAC 


GTCTGGGTGA 


AGCAGGAGTG 


GCACGACTAC 


AAGCTGCGCT 


GGGACCCAGC 


480 


TGACTATGAG 


AATGTCACCT 


CCATCCGCAT 


CCCCTCCGAG 


CTCATCTGGC 


GGCCGGACAT 


540 


CGCCCTCTAC 


AACAATGCTG 


ACGGGGACTT 


CGCGGCCACC 


CACCTGACCA 


AGGCCCACCT 


600 


GTTCCATGAC 


GGGCGGGTGC 


AGCGGACTCC 


CCCGGCCATT 


TACAAGAGCT 


CCTGCAGCAT 


660 


CGACGTCACC 


TTCTTCCCCT 


TCGACCAGCA 


GAACTGCACC 


ATGAAATTCG 


GCTCCTGGAC 


720 


CTACGACAAG 


GCCAAGATCG 


ACCTGGTGAA 


CATGCACAGC 


CGCGTGGACC 


AGCTGGACTT 


780 


CTGGGAGAGT 


GGCGAGTGGC 


TCATCTCGGA 


CGCCGTGGGC 


ACCTACAACA 


CCAGGAAGTA 


840 


CGAGTGCTGC 


GCCGAGATCT 


ACCCGGACAT 


CACCTATGCC 


TACGCCATCC 


GGCGGCTGCC 


900 


GCTCTTCTAC 


. ACCATCAACC 


TCATCATCCC 


CTGGCTGCTC 


ATCTCCTGCC 


TCACCGCGCT 


960 


GGTCTTCTAC 


CTGCCCTCCG 


AGTGTGGCGA 


GAAGATCACG 


CTGTGCATCT 


CCGTGCTGCT 


1020 


GTCGCTCACC 


GTCTTCCTGC 


TGCTCATCAC 


CGAGATCATC 


CCGTCCACCT 


CACTGGTCAT 


1080 


CCCACTCATC 


GGCGAGTACC 


TGCTGTTCAC 


CATGATCTTC 


GTCACCCTGT 


CCATCGCCAT 


1140 


CACGGTCTTC 


GTGCTCAACG 


TGCACCACCG 


CTCGCCACGC 


ACGCACACCA 


TGCCCACCTG 


1200 


GGTACGCAGG 


GTCTTCCTGG 


ACATCGTGCC 


ACGCCTGCTC 


CTCATGAAGC 


GGCCGTCCGT 


1260 


GGTCAAGGAC 


AATTGCCGGC 


GGCTCATCGA 


GTCCATGCAT 


AAGATGGCCA 


GTGCCCCGCG 


1320 


CTTCTGGCCC 


GAGCCAGAAG 


GGGAGCCCCC 


TGCCACGAGC 


GGCACCCAGA 


GCCTGCACCC 


1380 


TCCCTCACCG 


TCCTTCTGCG 


TCCCCCTGGA 


TGTGCCGGCT 


GAGCCTGGGC 


CTTCCTGCAA 


1440 


GTCACCCTCC 


GACCAGCTCC 


CTCCTCAGCA 


GCCCCTGGAA 


GCTGAGAAAG 


CCAGCCCCCA 


1500 


CCCCTCGCCT 


GGACCCTGCC 


GCCCGCCCCA 


CGGCACCCAG 


GCACCAGGGC 


TGGCCAAAGC 


1560 


CAGGTCCCTC . 


AGCGTCCAGC 


ACATGTCCAG 


CCCTGGCGAA 


GCGGTGGAAG 


GCGGCGTCCG 


1620 


GTGCCGGTCT 


CGGAGCATCC 


AGTACTGTGT 


TCCCCGAGAC 


GATGCCGCCC 


CCGAGGCAGA 


1680 


TGGCCAGGCT 


GCCGGCGCCC 


TGGCCTCTCG 


CAACAGCCAC 


TCGGCTGAGC 


TCCCACCCCC 


1740 


AGACCAGCCC 


TCTCCGTGCA 


AATGCACATG 


CAAGAAGGAG 


CCCTCTTCGG 


TGTCCCCGAG 


1800 


CGCCACGGTC 


AAGACCCGCA 


GCACCAAAGC 


GCCGCCGCCG 


CACCTGCCCC 


TGTCGCCGGC 


1860 


CCTGAGCCGG 


GCGGTGGAGG 


GCGTCCAGTA 


CATTGCAGAC 


CACCTGAAGG 


CCGAAGACAC 


1920 


AGACTTCTCG 


GTGAAGGAGG 


ACTGGAAGTA 


CGTGGCCATG 


GTCATCGACC 


GCATCTTCCT 


1980 


CTGGATGTTC 


ATCATCGTCT 


GCCTGCTGGG 


GACGGTGGGC 


CTCTTCCTGC 


CGCCCTGGCT 


2040 


GGCTGGCATG 


ATCTAGGAAG 


GGACCGGGAG 


CCTGCGTGGC 


CTGGGGCTGC 


CGYGCACGGG 


2100 
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GCCAGCATCC ATGCGGCCGG CCTGGGGCCG GGCTGGCTTC TCCCTGGACT CTGTGGGGCC 2160 

ACACGTTTGC CAAATTTTCC TTCCTGTTCT GTGTCTGCTG TAAGACGGCC TTGGACGGCG 2220 

ACACGGCCTC TGGGGAGACC GAGTGTGGAG CTGCTTCCAG TTGGACTCTS GCCTCAGNAG 2280 

GCAGCGGCTT GGAGCAGAGG TGGCGGTCGC CGCCTYCTAC CTGCAGGACT CGGGCTAAGT 2340 

CCAGCTCTCC CCCTGCGCAG CCC 23 63 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 627 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Glu Leu Gly Gly Pro Gly Ala Pro Arg Leu Leu Pro Pro Leu Leu 
15 10 15 

Leu Leu Leu Gly Thr Gly Leu Leu Arg Ala Ser Ser His Val Glu Thr 
20 25 30 

Arg Ala His Ala Glu Glu Arg Leu Leu Lys Lys Leu Phe Ser Gly Tyr 
35 40 45 

Asn Lys Trp Ser Arg Pro Val Ala Asn He Ser Asp Val Val Leu Val 
50 55 60 

Arg Phe Gly Leu Ser He Ala Gin Leu He Asp Val Asp Glu Lys Asn 
65 70 75 80 

Gin Met Met Thr Thr Asn Val Trp Val Lys Gin Glu Trp His Asp Tyr 
85 90 95 

Lys Leu Arg Trp Asp Pro Ala Asp Tyr Glu Asn Val Thr Ser He Arg 
100 105 HO 

He Pro Ser Glu Leu lie Trp Arg Pro Asp He Ala Leu Tyr Asn Asn 
115 120 125 

Ala Asp Gly Asp Phe Ala Ala Thr His Leu Thr Lys Ala His Leu Phe 
130 135 140 

His Asp Gly Arg Val Gin Arg Thr Pro Pro Ala He Tyr Lys Ser Ser 
145 150 155 160 

Cys Ser He Asp Val Thr Phe Phe Pro Phe Asp Gin Gin Asn Cys Thr 
165 170 175 

Met Lys Phe Gly Ser Trp Thr Tyr Asp Lys Ala Lys lie Asp Leu Val 
180 185 190 

Asn Met His Ser Arg Val Asp Gin Leu Asp Phe Trp Glu Ser Gly Glu 
195 200 205 



10 



Trp Leu He Ser Asp Ala Val Gly Thr Tyr Asn Thr Arg Lys Tyr Glu 
210 215 220 

Cvs Cys Ala Glu He Tyr Pro Asp He Thr Tyr Ala Tyr Ala He Arg 
225 230 235 240 

Arg Leu Pro Leu Phe Tyr Thr He Asn Leu He He Pro Trp Leu Leu 
245 250 255 

He Ser Cys Leu Thr Ala Leu Val Phe Tyr Leu Pro Ser Glu Cys Gly 
260 265 270 

Glu Lvs He Thr Leu Cys He Ser Val Leu Leu Ser Leu Thr Val Phe 
275 280 285 

Leu Leu Leu He Thr Glu He He Pro Ser Thr Ser Leu Val He Pro 
290 295 300 

Leu He Gly Glu Tyr Leu Leu Phe Thr Met He Phe Val Thr Leu Ser 
305 310 315 320 

He Ala He Thr Val Phe Val Leu Asn Val His His Arg Ser Pro Arg 

-joc ^**0 335 

— - 

Thr His Thr Met Pro Thr Trp Val Arg Arg Val Phe Leu Asp He Val 
340 345 350 

Pro Arg Leu Leu Leu Met Lys Arg Pro Ser Val Val Lys Asp Asn Cys 
355 360 365 

Arg Arg Leu He Glu Ser Met His Lys Met Ala Ser Ala Pro Arg Phe 
370 375 380 

Trp Pro Glu Pro Glu Gly Glu Pro Pro Ala Thr Ser Gly Thr Gin Ser 
385 390 395 400 

Leu His Pro Pro Ser Pro Ser Phe Cys Val Pro Leu Asp Val Pro Ala 
405 410 415 

Glu Pro Gly Pro Ser Cys Lys Ser Pro Ser Asp Gin Leu Pro Pro Gin 
420 425 430 

Gin Pro Leu Glu Ala Glu Lys Ala Ser Pro His Pro Ser Pro Gly Pro 
435 440 445 

Cys Arg Pro Pro His Gly Thr Gin Ala Pro Gly Leu Ala Lys Ala Arg 
450 455 460 

Ser Leu Ser Val Gin His Met Ser Ser Pro Gly Glu Ala Val Glu Gly 
465 470 475 480 

Glv Val Arg Cys Arg Ser Arg Ser He Gin Tyr Cys Val Pro Arg Asp 
485 490 495 

Asp Ala Ala Pro Glu Ala Asp Gly Gin Ala Ala Gly Ala Leu Ala Ser 
500 505 510 

Arg Asn Ser His Ser Ala Glu Leu Pro Pro Pro Asp Gin Pro Ser Pro 
. 515 520 525 

Cys Lys Cys Thr Cys Lys Lys Glu Pro Ser Ser Val Ser Pro Ser Ala 
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530 535 540 

Thr Val Lys Thr Arg Ser Thr Lys Ala Pro Pro Pro His Leu Pro Leu 
545 550 555 560 

Ser Pro Ala Leu Ser Arg Ala Val Glu Gly Val Gin Tyr lie Ala Asp 
565 570 575 

His Leu Lys Ala Glu Asp Thr Asp Phe Ser Val Lys Glu Asp Trp Lys 
580 585 590 

Tyr Val Ala Met Val lie Asp Arg lie Phe Leu Trp Met Phe lie lie 
595 600 605 

Val Cys Leu Leu Gly Thr Val Gly Leu Phe Leu Pro Pro Trp Leu Ala 
610 615 620 

Gly Met He 
625 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 155.. 1561 

(D) OTHER INFORMATION: /product^ "ALPHA- 5 SUBUNIT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CCCGGCGGGA GCTGTGGCGC GGAGCGGCCC CGCTGCTGCG TCTGCCCTCG TTTTGTCTCA 60 

CGACTCACAC TCAGTGCTGC ATTCCCCAAG AGTTCGCGTT CCCCGCGCGG CGGTCGAGAG 120 

GCGGCTGCCC GCGGTCCCGC GCGGGCGCGG GGCG ATG GCG GCG CGG GGG TCA 172 

Met Ala Ala Arg Gly Ser 
1 5 

GGG CCC CGC GCG CTC CGC CTG CTG CTC TTG GTC CAG CTG GTC GCG GGG 220 
Gly Pro Arg Ala Leu Arg Leu Leu Leu Leu Val Gin Leu Val Ala Gly 
10 15 20 

CGC TGC GGT CTA GCG GGC GCG GCG GGC GGC GCG CAG AGA GGA TTA TCT 268 
Arg Cys Gly Leu Ala Gly Ala Ala Gly Gly Ala Gin Arg Gly Leu Ser 
25 30 35 

GAA CCT TCT TCT ATT GCA AAA CAT GAA GAT AGT TTG CTT AAG GAT TTA 316 
Glu Pro Ser Ser He Ala Lys His Glu Asp Ser Leu Leu Lys Asp Leu 
40 45 50 

TTT CAA GAC TAC GAA AGA TGG GTT CGT CCT GTG GAA CAC CTG AAT GAC 364 
Phe Gin Asp Tyr Glu Arg Trp Val Arg Pro Val Glu His Leu Asn Asp 



12 



SD9951 



55 60 65 70 

AAA ATA AAA ATA AAA TTT GGA CTT GCA ATA TCT CAA TTG GTG GAT GTG 412 
Lys lie Lys lie Lys Phe Gly Leu Ala lie Ser Gin Leu Val Asp Val 
75 80 85 

GAT GAG AAA AAT CAG TTA ATG ACA ACA AAC GTC TGG TTG AAA CAG GAA 460 
Asp Glu Lys Asn Gin Leu Met Thr Thr Asn Val Trp Leu Lys Gin Glu 
90 95 100 

TGG ATA GAT GTA AAA TTA AGA TGG AAC CCT GAT GAG TAT GGT GGA ATA 508 
Trp lie Asp Val Lys Leu Arg Trp Asn Pro Asp Asp Tyr Gly Gly lie 
105 110 115 

AAA GTT ATA CGT GTT CCT TCA GAC TCT GTC TGG ACA CCA GAC ATC GTT 556 
Lys Val lie Arg Val Pro Ser Asp Ser Val Trp Thr Pro Asp lie Val 
120 125 130 

TTG TTT GAT AAT GCA GAT GGA CGT TTT GAA GGG ACC AGT ACG AAA ACA 604 
Leu Phe Asp Asn Ala Asp Gly Arg Phe Glu Gly Thr Ser Thr Lys Thr 
135 140 145 150 

GTC ATC AGG TAC AAT GGC ACT GTC ACC TGG ACT CCA CCG GCA AAC TAC 652 
Val lie Arg Tyr Asn Gly Thr Val Thr Trp Thr Pro Pro Ala Asn Tyr 
155 160 165 

AAA AGT TCC TGT ACC ATA GAT GTC ACG TTT TTC CCA TTT GAC CTT CAG 700 
Lys Ser Ser Cys Thr lie Asp Val Thr Phe Phe Pro Phe Asp Leu Gin 
170 175 180 

AAC TGT TCC ATG AAA TTT GGT TCT TGG ACT TAT GAT GGA TCA CAG GTT 748 
Asn Cys Ser Met Lys Phe Gly Ser Trp Thr Tyr Asp Gly Ser Gin Val 
185 190 195 

GAT ATA ATT CTA GAG GAC CAA GAT GTA GAC AAG AGA GAT TTT TTT GAT 796 
Asp lie lie Leu Glu Asp Gin Asp Val Asp Lys Arg Asp Phe Phe Asp 
200 205 210 

AAT GGA GAA TGG GAG ATT GTG AGT GCA ACA GGG AGC AAA GGA AAC AGA 844 
Asn Gly Glu Trp Glu lie Val Ser Ala Thr Gly Ser Lys Gly Asn Arg 
215 220 225 230 

ACC GAC AGC TGT TGC TGG TAT CCG TAT GTC ACT TAC TCA TTT GTA ATC 892 
Thr Asp Ser Cys Cys Trp Tyr Pro Tyr Val Thr Tyr Ser Phe Val lie 
235 240 245 

AAG CGC CTG CCT CTC TTT TAT ACC TTG TTC CTT ATA ATA CCC TGT ATT 940 
Lys Arg Leu Pro Leu Phe Tyr Thr Leu Phe Leu lie lie Pro Cys lie 
250 255 260 

GGG CTC TCA TTT TTA ACT GTA CTT GTC TTC TAT CTT CCT TCA AAT GAA 988 
Gly Leu Ser Phe Leu Thr Val Leu Val Phe Tyr Leu Pro Ser Asn Glu 
265 270 275 

GGT GAA AAG ATT TGT CTC TGC ACT TCA GTA CTT GTG TCT TTG ACT GTC 1036 
Gly Glu Lys lie Cys Leu Cys Thr Ser Val Leu Val Ser Leu Thr Val 
280 285 290 

TTC CTT CTG GTT ATT GAA GAG ATC ATA CCA TCA TCT TCA AAA GTC ATA 1084 
Phe Leu Leu Val lie Glu Glu lie lie Pro Ser Ser Ser Lys Val He 
295 300 305 310 
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CCT CTA ATT GGA GAG TAT CTG GTA TTT ACC ATG ATT TTT GTG ACA CTG 1132 
Pro Leu lie Gly Glu Tyr Leu Val Phe Thr Met lie Phe Val Thr Leu 
315 320 325 

TCA ATT ATG GTA ACC GTC TTC GCT ATC AAC ATT CAT CAT CGT TCT TCC 1180 
Ser lie Met Val Thr Val Phe Ala lie Asn lie His His Arg Ser Ser 
330 335 340 

TCA ACA CAT AAT GCC ATG GCG CCT TTG GTC CGC AAG ATA TTT CTT CAC 1228 
Ser Thr His Asn Ala Met Ala Pro Leu Val Arg Lys lie Phe Leu His 
345 350 355 

ACG CTT CCC AAA CTG CTT TGC ATG AGA AGT CAT GTA GAC AGG TAC TTC 1276 
Thr Leu Pro Lys Leu Leu Cys Met Arg Ser His Val Asp Arg Tyr Phe 
360 365 370 

ACT CAG AAA GAG GAA ACT GAG AGT GGT AGT GGA CCA AAA TCT TCT AGA 1324 
Thr Gin Lys Glu Glu Thr Glu Ser Gly Ser Gly Pro Lys Ser Ser Arg 
375 380 385 390 

AAC ACA TTG GAA GCT GCG CTC AAT TCT ATT CGC TAC ATT ACA AGA CAC 1372 
Asn Thr Leu Glu Ala Ala Leu Asn Ser lie Arg Tyr lie Thr Arg His 
395 400 405 

ATC ATG AAG GAA AAT GAT GTC CGT GAG GTT GTT GAA GAT TGG AAA TTC 1420 
lie Met Lys Glu Asn Asp Val Arg Glu Val Val Glu Asp Trp Lys Phe 
410 415 420 

ATA GCC CAG GTT CTT GAT CGG ATG TTT CTG TGG ACT TTT CTT TTC GTT 1468 
lie Ala Gin Val Leu Asp Arg Met Phe Leu Trp Thr Phe Leu Phe Val 
425 430 435 

TCA ATT GTT GGA TCT CTT GGG CTT TTT GTT CCT GTT ATT TAT AAA TGG 1516 
Ser lie Val Gly Ser Leu Gly Leu Phe Val Pro Val lie Tyr Lys Trp 
440 445 450 

GCA AAT ATA TTA ATA CCA GTT CAT ATT GGA AAT GCA AAT AAG TGAAGCCTCC 1568 
Ala Asn lie Leu lie Pro Val His lie Gly Asn Ala Asn Lys 



455 


460 


465 








CAAGGGACTG 


AAGTATACAT 


TTAGTTAACA 


CACATATATC 


TGATGGCACC 


TATAAAATTA 


1628 


TGAAAATGTA 


AGTTATGTGT 


TAAATTTAGT 


GCAAGCTTTA 


ACAGACTAAG 


TTGCTAACCT 


1688 


CAATTTATGT 


TAACAGATGA 


TCCATTTGAA 


CAGTTGGCTG 


TATGACTGAA 


GTAATAACTG 


1748 


ATGAGATACA 


TTTGATCTTG 


TAAAAATAGC 


AAAATATTAT 


CTGAACTGGA 


CTAGTGAAAA 


1808 


ATCTAGTATT 


TGTATCCTGG 










1828 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
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325 330 335 

lie His His Arg Ser Ser Ser Thr His Asn Ala Met Ala Pro Leu Val 
340 345 350 

Arg Lys lie Phe Leu His Thr Leu Pro Lys Leu Leu Cys Met Arg Ser 
355 360 365 

His Val Asp Arg Tyr Phe Thr Gin Lys Glu Glu Thr Glu Ser Gly Ser 
370 375 380 

Gly Pro Lys Ser Ser Arg Asn Thr Leu Glu Ala Ala Leu Asn Ser lie 
385 390 395 400 

Arg Tyr lie Thr Arg His lie Met Lys Glu Asn Asp Val Arg Glu Val 
405 410 415 

Val Glu Asp Trp Lys Phe lie Ala Gin Val Leu Asp Arg Met Phe Leu 
420 425 430 

Trp Thr Phe Leu Phe Val Ser lie Val Gly Ser Leu Gly Leu Phe Val 
435 440 445 

Pro Val lie Tyr Lys Trp Ala Asn He Leu He Pro Val His He Gly 
450 455 460 

Asn Ala Asn Lys 
465 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1743 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 143 .. 1627 

(D) OTHER INFORMATION: /product= "ALPHA- 6 SUBUNIT" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

CGGGTTTTGA TTTCTGAGAA GACACACACG GATTGCAGTG GGCTTCTGAT GATGTCAAGG 60 

TTGGATGCAT GTGGCTGACT GATAGCTCTT TGTTTTCCAC AATCCTTTGC CTAGGAAAAA 120 

GGAATCCAAG TGTGTTTTAA CO ATG CTG ACC AGC AAG GGG CAG GGA TTC CTT 172 

Met Leu Thr Ser Lys Gly Gin Gly Phe Leu 
15 10 

CAT GGG GGC TTG TGT CTC TGG CTG TGT GTG TTC AC A CCT TTC TTT AAA 220 
His Gly Gly Leu Cys Leu Trp Leu Cys Val Phe Thr Pro Phe Phe Lys 
15 20 25 

GGC TGT GTG GGC TGT GCA ACT GAG GAG AGG CTC TTC CAC AAA CTG TTT 268 
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Gly Cys Val Gly Cys Ala Thr Glu Glu Arg Leu Phe His Lys Leu Phe 
30 35 40 

TCT CAT TAC AAC CAG TTC ATC AGG CCT GTG GAA AAC GTT TCC GAC CCT 316 
Ser His Tyr Asn Gin Phe lie Arg Pro Val Glu Asn Val Ser Asp Pro 
45 50 55 

GTC ACG GTA CAC TTT GAA GTG GCC ATC ACC CAG CTG GCC AAC GTG GAT 364 
Val Thr Val His Phe Glu Val Ala lie Thr Gin Leu Ala Asn Val Asp 
60 65 70 

GAA GTA AAC CAG ATC ATG GAA ACC AAT TTG TGG CTG CGT CAC ATC TGG 412 
Glu Val Asn Gin lie Met Glu Thr Asn Leu Trp Leu Arg His lie Trp 
75 80 85 90 

AAT GAT TAT AAA TTG CGC TGG GAT CCA ATG GAA TAT GAT GGC ATT GAG 460 
Asn Asp Tyr Lys Leu Arg Trp Asp Pro Met Glu Tyr Asp Gly lie Glu 
95 100 105 

ACT CTT CGC GTT CCT GCA GAT AAG ATT TGG AAG CCC GAC ATT GTT CTC 508 
Thr Leu Arg Val Pro Ala- Asp Lys lie Trp Lys Pro Asp lie Val Leu 
110 115 120 

TAT AAC AAT GCT GTT GGT GAC TTC CAA GTA GAA GGC AAA ACA AAA GCT 556 
Tyr Asn Asn Ala Val Gly Asp Phe Gin Val Glu Gly Lys Thr Lys Ala 
125 130 135 

CTT CTT AAA TAC AAT GGC ATG ATA ACC TGG ACT CCA CCA GCT ATT TTT 604 
Leu Leu Lys Tyr Asn Gly Met lie Thr Trp Thr Pro Pro Ala lie Phe 
140 145 150 

AAG AGT TCC TGC CCT ATG GAT ATC ACC TTT TTC CCT TTT GAT CAT CAA 652 
Lys Ser Ser Cys Pro Met Asp lie Thr Phe Phe Pro Phe Asp His Gin 
155 160 165 170 

AAC TGT TCC CTA AAA TTT GGT TCC TGG ACG TAT GAC AAA GCT GAA ATT 700 
Asn Cys Ser Leu Lys Phe Gly Ser Trp Thr Tyr Asp Lys Ala Glu lie 
175 180 185 

GAT CTT CTA ATC ATT GGA TCA AAA GTG GAT ATG AAT GAT TTT TGG GAA 748 
Asp Leu Leu lie lie Gly Ser Lys Val Asp Met Asn Asp Phe Trp Glu 
190 195 200 

AAC AGT GAA TGG GAA ATC ATT GAT GCC TCT GGC TAC AAA CAT GAC ATC 796 
Asn Ser Glu Trp Glu lie lie Asp Ala Ser Gly Tyr Lys His Asp lie 
205 210 215 

AAA TAC AAC TGT TGT GAA GAG ATA TAC ACA GAT ATA ACC TAT TCT TTC 844 
Lys Tyr Asn Cys Cys Glu Glu lie Tyr Thr Asp lie Thr Tyr Ser Phe 
220 225 230 

TAC ATT AGA AGA TTG CCG ATG TTT TAC ACG ATT AAT CTG ATC ATC CCT 892 
Tyr lie Arg Arg Leu Pro Met Phe Tyr Thr lie Asn Leu lie lie Pro 
235 240 245 250 

TGT CTC TTT ATT TCA TTT CTA ACC GTG TTG GTC TTT TAC CTT CCT TCG 940 
Cys Leu Phe lie Ser Phe Leu Thr Val Leu Val Phe Tyr Leu Pro Ser 
255 260 265 

GAC TGT GGT GAA AAA GTG ACG CTT TGT ATT TCA GTC CTG CTT TCT CTG 988 
Asp Cys Gly Glu Lys Val Thr Leu Cys lie Ser Val Leu Leu Ser Leu 
270 275 280 
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ACT GTG TTT TTG CTG GTC ATC AC A GAA ACC ATC CCA TCC ACA TCT CTG 1036 
Thr Val Phe Leu Leu Val lie Thr Glu Thr lie Pro Ser Thr Ser Leu 
285 290 295 

GTG GTC CCA CTG GTG GGT GAG TAC CTG CTG TTC ACC ATG ATC TTT GTC 1084 
Val Val Pro Leu Val Gly Glu Tyr Leu Leu Phe Thr Met lie Phe Val 
300 305 310 

ACA CTG TCC ATC GTG GTG ACT GTG TTT GTG TTG AAC ATA CAC TAC CGC 1132 
Thr Leu Ser lie Val Val Thr Val Phe Val Leu Asn lie His Tyr Arg 
315 320 325 330 

ACC CCA ACC ACG CAC ACA ATG CCC AGG TGG GTG AAG ACA GTT TTC CTG 1180 
Thr Pro Thr Thr His Thr Met Pro Arg Trp Val Lys Thr Val Phe Leu 
335 340 345 

AAG CTG CTG CCC CAG GTC CTG CTG ATG AGG TGG CCT CTG GAC AAG ACA 1228 
Lys Leu Leu Pro Gin Val Leu Leu Met Arg Trp Pro Leu Asp Lys Thr 
350 355 360 

AGG GGC ACA GGC TCT GAT GCA GTG CCC AGA GGC CTT GCC AGG AGG CCT 1276 
Arg Gly Thr Gly Ser Asp Ala Val Pro Arg Gly Leu Ala Arg Arg Pro 
365 370 375 

GCC AAA GGC AAG CTT GCA AGC CAT GGG GAA CCC AGA CAT CTT AAA GAA 1324 
Ala Lys Gly Lys Leu Ala Ser His Gly Glu Pro Arg His Leu Lys Glu 
380 i3b 390 

TGC TTC CAT TGT CAC AAA VCA AAT GAG CTT GCC ACA AGC AAG AGA AGA 1372 
Cys Phe His Cys His Lys Ser Asn Glu Leu Ala Thr Ser Lys Arg Arg 
395 400 405 410 

TTA AGT CAT CAG CCA TTA CAG TGG GTG GTG GAA AAT TCG GAG CAC TCG 1420 
Leu Ser His Gin Pro Leu Gin Trp Val Val Glu Asn Ser Glu His Ser 
415 420 425 

CCT GAA GTT GAA GAT GTG ATT AAC AGT GTT CAG TTC ATA GCA GAA AAC 1468 
Pro Glu Val Glu Asp Val lie Asn Ser Val Gin Phe lie Ala Glu Asn 
430 435 440 

ATG AAG AGC CAC AAT GAA ACC AAG GAG GTA GAA GAT GAC TGG AAA TAC 1516 
Met Lys Ser His Asn Glu Thr Lys Glu Val Glu Asp Asp Trp Lys Tyr 
445 450 455 

GTG GCC ATG GTG GTG GAC AGA GTA TTT CTT TGG GTA TTT ATA ATT GTC 1564 
Val Ala Met Val Val Asp Arg Val Phe Leu Trp Val Phe lie lie Val 
460 465 470 

TGT GTA TTT GGA ACT GCA GGG CTA TTT CTA CAG CCA CTA CTT GGG AAC 1612 
Cys Val Phe Gly Thr Ala Gly Leu Phe Leu Gin Pro Leu Leu Gly Asn 
475 480 485 490 

ACA GGA AAA TCT TAAAATGTAT TTTCTTTTAT GTTCAGAAAT TTACAGACAC 1664 
Thr Gly Lys Ser 

495 

CATATTTGTT CTGCATTCCC TGCCACAAGG AAAGGAAAGC AAAGGCTTCC CACCCAAGTC 1724 
CCCCATCTGC TAAAACCCG 1743 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 494 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Leu Thr Ser Lys Gly Gin Gly Phe Leu His Gly Gly Leu Cys Leu 
15 10 15 

Trp Leu Cys Val Phe Thr Pro Phe Phe Lys Gly Cys Val Gly Cys Ala 
20 25 30 

Thr Glu Glu Arg Leu Phe His Lys Leu Phe Ser His Tyr Asn Gin Phe 
35 40 45 

He Arg Pro Val Glu Asn Val Ser Asp Pro Val Thr Val His Phe Glu 
50 55 60 

Val Ala He Thr Gin Leu Ala Asn Val Asp Glu Val Asn Gin He Met 
65 70 75 80 

Glu Thr Asn Leu Trp Leu Arg His He Trp Asn Asp Tyr Lys Leu Arg 
85 90 95 

Trp Asp Pro Met Glu Tyr Asp Gly He Glu Thr Leu Arg Val Pro Ala 
100 105 110 

Asp Lys He Trp Lys Pro Asp He Val Leu Tyr Asn Asn Ala Val Gly 
115 120 125 

Asp Phe Gin Val Glu Gly Lys Thr Lys Ala Leu Leu Lys Tyr Asn Gly 
130 135 140 

Met lie Thr Trp Thr Pro Pro Ala He Phe Lys Ser Ser Cys Pro Met 
145 150 155 160 

Asp He Thr Phe Phe Pro Phe Asp His Gin Asn Cys Ser Leu Lys Phe 
165 170 175 

Gly Ser Trp Thr Tyr Asp Lys Ala Glu He Asp Leu Leu He He Gly 
180 185 190 

Ser Lys Val Asp Met Asn Asp Phe Trp Glu Asn Ser Glu Trp Glu He 
195 200 205 

He Asp Ala Ser Gly Tyr Lys His Asp He Lys Tyr Asn Cys Cys Glu 
210 215 220 

Glu He Tyr Thr Asp He Thr Tyr Ser Phe Tyr He Arg Arg Leu Pro 
225 230 235 240 

Met Phe Tyr Thr He Asn Leu He He Pro Cys Leu Phe He Ser Phe 
245 250 255 

Leu Thr Val Leu Val Phe Tyr Leu Pro Ser Asp Cys Gly Glu Lys Val 
260 265 270 
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Thr Leu Cys lie Ser Val Leu Leu Ser Leu Thr Val Phe Leu Leu Val 
275 280 285 

lie Thr Glu Thr lie Pro Ser Thr Ser Leu Val Val Pro Leu Val Gly 
290 295 300 

Glu Tyr Leu Leu Phe Thr Met lie Phe Val Thr Leu Ser lie Val Val 
305 310 315 320 

Thr Val Phe Val Leu Asn lie His Tyr Arg Thr Pro Thr Thr His Thr 
325 330 335 

Met Pro Arg Trp Val Lys Thr Val Phe Leu Lys Leu Leu Pro Gin Val 
340 345 350 

Leu Leu Met Arg Trp Pro Leu Asp Lys Thr Arg Gly Thr Gly Ser Asp 
355 360 365 

Ala Val Pro Arg Gly Leu Ala Arg Arg Pro Ala Lys Gly Lys Leu Ala 
370 375 380 

Ser His Gly Glu Pro Arg His Leu Lys Glu Cys Phe His Cys His Lys 
385 390 395 400 

Ser Asn Glu Leu Ala Thr Ser Lys Arg Arg Leu Ser His Gin Pro Leu 
405 410 415 

Gin Trp Val Val Glu Asn Ser Glu His Ser Pro Glu Val Glu Asp Val 
420 425 430 

lie Asn Ser Val Gin Phe lie Ala Glu Asn Met Lys Ser His Asn Glu 
435 440 445 

Thr Lys Glu Val Glu Asp Asp Trp Lys Tyr Val Ala Met Val Val Asp 
450 455 460 

Arg Val Phe Leu Trp Val Phe lie lie Val Cys Val Phe Gly Thr Ala 
465 470 475 480 



Gly Leu Phe Leu Gin Pro Leu Leu Gly Asn Thr Gly Lys Ser 
485 490 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 73.. 1581 

(D) OTHER INFORMATION: /product= w ALPHA-7 SUBUNIT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
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GGCCGCAGGC 


GCAGGCCCGG 


GCGACAGCCG 


AGACGTGGAG 


CGCGCCGGCT 


CGCTGCAGCT 


60 


CCGGGACTCA 


ACATGCGCTG 


CTCGCCGGGA 


GGCGTCTGGC 


TGGCGCTGGC 


CGCGTCGCTC 


120 


CTGCACGTGT 


CCCTGCAAGG 


CGAGTTCCAG 


AGGAAGCTTT 


ACAAGGAGCT 


GGTCAAGAAC 


180 


TACAATCCCT 


TGGAGAGGCC 


CGTGGCCAAT 


GACTCGCAAC 


CACTCACCGT 


CTACTTCTCC 


240 


CTGAGCCTCC 


TGCAGATCAT 


GGACGTGGAT 


GAGAAGAACC 


AAGTTTTAAC 


CACCAACATT 


300 


TGGCTGCAAA 


TGTCTTGGAC 


AGATCACTAT 


TTACAGTGGA 


ATGTGTCAGA 


ATATCCAGGG 


360 


GTGAAGACTG 


TTCGTTTCCC 


AGATGGCCAG 


ATTTGGAAAC 


CAGACATTCT 


TCTCTATAAC 


420 


AGTGCTGATG 


AGCGCTTTGA 


CGCCACATTC 


CACACTAACG 


TGTTGGTGAA 


TTCTTCTGGG 


480 


CATTGCCAGT 


ACCTGCCTCC 


AGGCATATTC 


AAGAGTTCCT 


GCTACATCGA 


TGTACGCTGG 


540 




n m/*i m ^ /7i/"v» 


CTGCAAACTG 


AAGTTTGGGT 


C CTGGTCTT A 


CGGAGGCTGG 


600 


TCCTTGGATC 


TGCAGATGCA 


GGAGGCAGAT 


ATCAGTGGCT 


ATATCCCCAA 


TGGAGAATGG 


660 


GACCTAGTGG 


GAATCCCCGG 


CAAGAGGAGT 


GAAAGGTTCT 


ATGAGTGCTG 


CAAAGAGCCC 


720 


TACCCCGATG 


TCACCTTCAC 


AGTGACCATG 


CGCCGCAGGA 


CGCTCTACTA 


TGGCCTCAAC 


780 


CTGCTGATCC 


CCTGTGTGCT 


CATCTCCGCC 


CTCGCCCTGC 


TGGTGTTCCT 


GCTTCCTGCA 


840 


GATTCCGGGG 


AGAAGATTTC 


CCTGGGGATA 


ACAGTCTTAC 


TCTCTCTTAC 


CGTCTTCATG 


900 


CTGCTCGTGG 


CTGAGATCAT 


GCCCGCAACA 


TCCGATTCGG 


TACCATTGAT 


AGCCCAGTAC 


960 


TTCGCCAGCA 


CCATGATCAT 


CGTGGGCCTC 


TCGGTGGTGG 


TGACGGTGAT 


CGTGCTGCAG 


1020 


TACCACCACC 


ACGACCCCGA 


CGGGGGCAAG 


ATGCCCAAGT 


GGACCAGAGT 


CATCCTTCTG 


1080 


AACTGGTGCG 


CGTGGTTCCT 


SCGAATGAAG 


AGGCCCGGGG 


AGGACAAGGT 


GCGCCCGGCC 


1140 


TGCCAGCACA 


AGCAGCGGCG 


CTGCAGCCTG 


GCCAGTGTGG 


AGATGAGCGC 


CGTGGCGCCG 


1200 


CCGCCCGCCA 


GCAACGGGAA 


CCTGCTGTAC 


ATCGGCTTCC 


GCGGCCTGGA 


CGGCGTGCAC 


1260 


TGTGTCCCGA 


CCCCCGACTC 


TGGGGTAGTG 


TGTGGCCGCA 


TGGCCTGCTC 


CCCCACGCAC 


1320 


GATGAGCACC 


TCCTGCACGG 


CGGGCAACCC 


CCCGAGGGGG 


ACCCGGACTT 


GGCCAAGATC 


1380 


CTGGAGGAGG 


TCCGCTACAT 


TGCCAATCGC 


TTCCGCTGCC 


AGGACGAAAG 


CGAGGCGGTC 


1440 


TGCAGCGAGT 


GGAAGTTCGC 


CGCCTGTGTG 


GTGGACCGCC 


TGTGCCTCAT 


GGCCTTCTCG 


1500 


GTCTTCACCA 


TCATCTGCAC 


CATCGGCATC 


CTGATGTCGG 


CTCCCAACTT 


CGTGGAGGCC 


1560 


GTGTCCAAAG 


ACTTTGCGTA 


ACCACGCCTG 


GTTCTGTACA 


TGTGGAAAAC 


TCACAGATGG 


1620 


GCAAGGCCTT 


TGGCTTGGCG 


AGATTTGGGG 


GTGCTAATCC 


AGGACAGCAT 


TACACGCCAC 


1680 


AACTCCAGTG 


TTCCCTTCTG 


GCTGTCAGTC 


GTGTTGCTTA 


CGGTTTCTTT 


GTTACTTTAG 


1740 


GTAGTAGAAT 


CTCAGCACTT 


TGTTTCATAT 


TCTCAGATGG 


GCTGATAGAT 


ATCCTTGGCA 


1800 


CATCCGTACC 


ATCGGTCAGC 


AGGGCCACTG 


AGTAGTCATT 


TTGCCCATTA 


GCCCACTGCC 


1860 
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TGGAAAGCCC TTCGGA 1876 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 502 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Arg Cys Ser Pro Gly Gly Val Trp Leu Ala Leu Ala Ala Ser Leu 
15 10 15 

Leu His Val Ser Leu Gin Gly Glu Phe Gin Arg Lys Leu Tyr Lys Glu 
20 25 30 

Leu Val Lys Asn Tyr Asn Pro Leu Glu Arg Pro Val Ala Asn Asp Ser 
35 40 45 

Gin Pro Leu Thr Val Tyr Phe Ser Leu Ser Leu Leu Gin lie Met Asp 
50 55 60 

Val Asp Glu Lys Asn Gin Val Leu Thr Thr Asn lie Trp Leu Gin Met 
65 70 75 80 

Ser. Trp Thr Asp His Tyr Leu Gin Trp Asn Val Ser Glu Tyr Pro Gly 
, :* 85 90 95 

Val- Lys Thr Val Arg Phe Pro Asp Gly Gin lie Trp Lys Pro Asp lie 

, ^.4> : ; 100 105 110 

Leu In yr Asn Ser Ala Asp Glu Arg Phe Asp Ala Thr Phe His Thr 
.15 120 125 

Aan Val Leu Val Asn Ser Ser Gly His Cys Gin Tyr Leu Pro Pro Gly 
.: 130 - 135 140 

lie Phe Lys Ser Ser Cys Tyr lie Asp Val Arg Trp Phe Pro Phe Asp 
145 150 155 160 

Val Gin His Cys Lys Leu Lys Phe Gly Ser Trp Ser Tyr Gly Gly Trp 
165 170 175 

Ser Leu Asp Leu Gin Met Gin Glu Ala Asp lie Ser Gly Tyr lie Pro 
180 185 190 

Asn Gly Glu Trp Asp Leu Val Gly lie Pro Gly Lys Arg Ser Glu Arg 
195 200 205 

Phe Tyr Glu Cys Cys Lys Glu Pro Tyr Pro Asp Val Thr Phe Thr Val 
210 215 220 

Thr Met Arg Arg Arg Thr Leu Tyr Tyr Gly Leu Asn Leu Leu lie Pro 
225 230 235 240 

Cys Val Leu lie Ser Ala Leu Ala Leu Leu Val Phe Leu Leu Pro Ala 



22 



245 250 255 

Asp Ser Gly Glu Lys lie Ser Leu Gly lie Thr Val Leu Leu Ser Leu 
260 265 270 

Thr Val Phe Met Leu Leu Val Ala Glu lie Met Pro Ala Thr Ser Asp 
275 280 285 

Ser Val Pro Leu lie Ala Gin Tyr Phe Ala Ser Thr Met He He Val 
290 295 300 

Gly Leu Ser Val Val Val Thr Val He Val Leu Gin Tyr His His His 
305 310 315 320 

Asp Pro Asp Gly Gly Lys Met Pro Lys Trp Thr Arg Val He Leu Leu 
325 330 335 

Asn Trp Cys Ala Trp Phe Leu Arg Met Lys Arg Pro Gly Glu Asp Lys 
340 345 350 

Val Arg Pro Ala Cys Gin His Lys Gin Arg Arg Cys Ser Leu Ala Ser 
355 360 365 

Val Glu Met Ser Ala Val Ala Pro Pro Pro Ala Ser Asn Gly Asn Leu 
370 375 380 

Leu Tyr lie Gly Phe Arg Gly Leu Asp Gly Val His Cys Val Pro Thr 
385 390 395 400 

Pro Asp Ser Gly Val Val Cys Gly Arg Met Ala Cys Ser Pro Thr His 
405 410 415 

Asp Glu His Leu Leu His Gly Gly Gin Pro Pro Glu Gly Asp Pro Asp 
420 425 430 

Leu Ala Lys He Leu Glu Glu Val Arg Tyr lie Ala Asn Arg Phe Arg 
435 440 445 

Cys Gin Asp Glu Ser Glu Ala Val Cys Ser Glu Trp Lys Phe Ala Ala 
450 455 460 

Cys Val Val Asp Arg Leu Cys Leu Met Ala Phe Ser Val Phe Thr lie 
465 470 475 480 

lie Cys Thr lie Gly lie Leu Met Ser Ala Pro Asn Phe Val Glu Ala 
485 490 495 

Val Ser Lys Asp Phe Ala 
500 



INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 
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(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 265.. 1773 

(D) OTHER INFORMATION: /product= "BETA -2 SUBUNIT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CTCCTCCCCC TCACCGTCCC AATTGTATTC CCTGGAAGAG CAGCCGGAAA AGCCTCCGCC 60 

TGCTCATACC AGGATAGGCA AGAAGCTGGT TTCTCCTCGC AGCCGGCTCC CTGAGGCCCA 120 

GGAACCACCG CGGCGGCCGG CACCACCTGG ACCCAGCTCC AGGCGGGCGC GGCTTCAGCA 180 

CCACGGACAG CGCCCCACCC GCGGCCCTCC CCCCGGCGGC GCGCTCCAGC CGGTGTAGGC 240 

GAGGCAGCGA GCTATGCCCG CGGC ATG GCC CGG CGC TGC GGC CCC GTG GCG 291 

Met Ala Arg Arg Cys Gly Pro Val Ala 
1 5 

CTG CTC CTT GGC TTC GGC CTC CTC CGG CTG TGC TCA GGG GTG TGG GGT 339 
Leu Leu Leu Gly Phe Gly Leu Leu Arg Leu Cys Ser Gly Val Trp Gly 
10 15 20 25 

ACG GAT ACA GAG GAG CGG CTG GTG GAG CAT CTC CTG GAT CCT TCC CGC 387 
Thr Asp Thr Glu Glu Arg Leu Val Glu His Leu Leu Asp Pro Ser Arg 
30 35 40 

TAC AAC AAG CTT ATC CGC CCA GCC ACC AAT GGC TCT GAG CTG GTG ACA 435 
Tyr Asn Lys Leu lie Arg Pro Ala Thr Asn Gly Ser Glu Leu Val Thr 
45 50 55 

GTA CAG CTT ATG GTG TCA CTG GCC CAG CTC ATC AGT GTG CAT GAG CGG 483 
Val Gin Leu Met Val Ser Leu Ala Gin Leu lie Ser Val His Glu Arg 
60 65 70 

GAG CAG ATC ATG ACC ACC AAT GTC TGG CTG ACC CAG GAG TGG GAA GAT 531 
Glu Gin lie Met Thr Thr Asn Val Trp Leu Thr Gin Glu Trp Glu Asp 
75 80 85 

TAT CGC CTC ACC TGG AAG CCT GAA GAG TTT GAC AAC ATG AAG AAA GTT 579 
Tyr Arg Leu Thr Trp Lys Pro Glu Glu Phe Asp Asn Met Lys Lys Val 
90 95 100 105 

CGG CTC CCT TCC AAA CAC ATC TGG CTC CCA GAT GTG GTC CTG TAC AAC 627 
Arg Leu Pro Ser Lys His lie Trp Leu Pro Asp Val Val Leu Tyr Asn 
110 115 120 

AAT GCT GAC GGC ATG TAC GAG GTG TCC TTC TAT TCC AAT GCC GTG GTC 675 
Asn Ala Asp Gly Met Tyr Glu Val Ser Phe Tyr Ser Asn Ala Val Val 
125 130 135 

TCC TAT GAT GGC AGC ATC TTC TGG CTG CCG CCT GCC ATC TAC AAG AGC 723 
Ser Tyr Asp Gly Ser lie Phe Trp Leu Pro Pro Ala lie Tyr Lys Ser 
140 145 150 

GCA TGC AAG ATT GAA GTA AAG CAC TTC CCA TTT GAC CAG CAG AAC TGC 771 
Ala Cys Lys lie Glu Val Lys His Phe Pro Phe Asp Gin Gin Asn Cys 
155 160 165 

ACC ATG AAG TTC CGT TCG TGG ACC TAC GAC CGC ACA GAG ATC GAC TTG 819 
Thr Met Lys Phe Arg Ser Trp Thr Tyr Asp Arg Thr Glu lie Asp Leu 
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170 175 180 185 

GTG CTG AAG AGT GAG GTG GCC AGC CTG GAC GAC TTC ACA CCT AGT GGT 867 
Val Leu Lys Ser Glu Val Ala Ser Leu Asp Asp Phe Thr Pro Ser Gly 
190 195 200 

GAG TGG GAC ATC GTG GCG CTG CCG GGC CGG CGC AAC GAG AAC CCC GAC 915 
Glu Trp Asp lie Val Ala Leu Pro Gly Arg Arg Asn Glu Asn Pro Asp 
205 210 215 

GAC TCT ACG TAC GTG GAC ATC ACG TAT GAC TTC ATC ATT CGC CGC AAG 963 
Asp Ser Thr Tyr Val Asp lie Thr Tyr Asp Phe lie lie Arg Arg Lys 
220 225 230 

CCG CTC TTC TAC ACC ATC AAC CTC ATC ATC CCC TGT GTG CTC ATC ACC 1011 
Pro Leu Phe Tyr Thr lie Asn Leu lie lie Pro Cys Val Leu lie Thr 
235 240 245 

TCG CTA GCC ATC CTT GTC TTC TAC CTG CCA TCC GAC TGT GGC GAG AAG 1059 
Ser Leu Ala lie Leu Val Phe Tyr Leu Pro Ser Asp Cys Gly Glu Lys 
250 255 260 265 

ATG ACG TTG TGC ATC TCA GTG CTG CTG GCG CTC ACG GTC TTC CTG CTG 1107 
Met Thr Leu Cys He Ser Val Leu Leu Ala Leu Thr Val Phe Leu Leu 
270 275 280 

CTC ATC TCC AAG ATC GTG CCT CCC ACC TCC CTC GAC GTG CCG CTC GTC 1155 
Leu He Ser Lys He Val Pro Pro Thr Ser Leu Asp Val Pro Leu Val 
285 290 295 

GGC AAG TAC CTC ATG TTC ACC ATG GTG CTT GTC ACC TTC TCC ATC GTC 1203 
Gly Lys Tyr Leu Met Phe Thr Met Val Leu Val Thr Phe Ser He Val 
300 305 310 

ACC AGC GTG TGC GTG CTC AAC GTG CAC CAC CGC TCG CCC ACC ACG CAC 1251 
Thr Ser Val Cys Val Leu Asn Val His His Arg Ser Pro Thr Thr His 
315 320 325 

ACC ATG GCG CCC TGG GTG AAG GTC GTC TTC CTG GAG AAG CTG CCC GCG 1299 
Thr Met Ala Pro Trp Val Lys Val Val Phe Leu Glu Lys Leu Pro Ala 
330 335 340 345 

CTG CTC TTC ATG CAG CAG CCA CGC CAT CAT TGC GCC CGT CAG CGC CTG 1347 
Leu Leu Phe Met Gin Gin Pro Arg His His Cys Ala Arg Gin Arg Leu 
350 355 360 

CGC CTG CGG CGA CGC CAG CGT GAG CGC GAG GGC GCT GGA GCC CTC TTC 1395 
Arg Leu Arg Arg Arg Gin Arg Glu Arg Glu Gly Ala Gly Ala Leu Phe 
365 370 375 

TTC CGC GAA GCC CCA GGG GCC GAC TCC TGC ACG TGC TTC GTC AAC CGC 1443 
Phe Arg Glu Ala Pro Gly Ala Asp Ser Cys Thr Cys Phe Val Asn Arg 
380 385 390 

GCG TCG GTG CAG GGG TTG GCC GGG GCC TTC GGG GCT GAG CCT GCA CCA 1491 
Ala Ser Val Gin Gly Leu Ala Gly Ala Phe Gly Ala Glu Pro Ala Pro 
395 400 405 

GTG GCG GGC CCC GGG CGC TCA GGG GAG CCG TGT GGC TGT GGC CTC CGG 1539 
Val Ala Gly Pro Gly Arg Ser Gly Glu Pro Cys Gly Cys Gly Leu Arg 
410 415 420 425 
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GAG GCG GTG GAC GGC GTG CGC TTC ATC GCA GAC CAC ATG CGG AGC GAG 1587 

Glu Ala Val Asp Gly Val Arg Phe He Ala Asp His Met Arg Ser Glu 

430 435 440 

GAC GAT GAC CAG AGC GTG AGT GAG GAC TGG AAG TAC GTC GCC ATG GTG 1635 

Asp Asp Asp Gin Ser Val Ser Glu Asp Trp Lys Tyr Val Ala Met Val 

445 450 455 

ATC GAC CGC CTC TTC CTC TGG ATC TTT GTC TTT GTC TGT GTC TTT GGC 1683 

He Asp Arg Leu Phe Leu Trp He Phe Val Phe Val Cys Val Phe Gly 

460 465 470 



ACC ATC GGC ATG TTC CTG CAG CCT CTC TTC CAG AAC TAC ACC ACC ACC 1731 
Thr He Gly Met Phe Leu Gin Pro Leu Phe Gin Asn Tyr Thr Thr Thr 
475 480 485 



ACC TTC CTC CAC TCA GAC CAC TCA GCC CCC AGC 
Thr Phe Leu His Ser Asp His Ser Ala Pro Ser 
490 495 500 


TCC AAG TGAGGCCCTT 
Ser Lys 


1780 


CCTCATCTCC 


ATGCTCTTTC 


ACCCTGCCAC 


CCTCTGCTGC 


ACAGTAGTGT 


TGGGTGGAGG 


1840 


ATGGACGAGT 


GAGCTACCAG 


GAAGAGGGGC 


GCTGCCCCCA 


CAGATCCATC 


CTTTTGCTTC 


1900 


ATCTGGAGTC 


CCTCCTCCCC 


CACGCCTCCA 


TCCACACACA 


GCAGCTCCAA 


CCTGGAGGCT 


1960 


GGACCAACTG 


CTTTGTTTTG 


GCTGCTCTCC 


ATCTCTTGTA 


CCAGCCCAGG 


CAATAGTGTT 


2020 


GAGGAGGGGA 


GCAAGGCTGC 


TAAGTGGAAG 


ACAGAGATGG 


CAGAGCCATC 


CACCCTGAGG 


2080 


AGTGACGGGC 


AAGGGGCCAG 


GAAGGGGACA 


GGATTGTCTG 


CTGCCTCCAA 


GTCATGGGAG 


2140 


AAGAGGGGTA 


TAGGACAAGG 


GGTGGAAGGG 


CAGGAGCTCA 


CACCGCACCG 


GGCTGGCCTG 


2200 


ACACAATGGT 


AGCTCTGAAG 


GGAGGGGAAG 


AGAGAGGCCT 


GGGTGTGACC 


TGACACCTGC 


2260 


CGCTGCTTGA 


GTGGACAGCA 


GCTGGACTGG 


GTGGGCCCCA 


CAGTGGTCAG 


CGATTCCTGC 


2320 


CAAGTAGGGT 


TTAGCCGGGC 


CCCATGGTCA 


CAGACCCCTG 


GGGGAGGCTT 


CCAGCTCAGT 


2380 


CCCACAGCCC 


CTTGCTTCTA 


AGGGATCCAG 


AGACCTGCTC 


CAGATCCTCT 


TTCCCCACTG 


2440 



AAGAATTC 2448 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Ala Arg Arg Cys Gly Pro Val Ala Leu Leu Leu Gly Phe Gly Leu 
15 10 15 

Leu Arg Leu Cys Ser Gly Val Trp Gly Thr Asp Thr Glu Glu Arg Leu 
20 25 30 
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Val Glu His Leu Leu Asp Pro Ser Arg Tyr Asn Lys Leu He Arg Pro 
35 40 45 

Ala Thr Asn Gly Ser Glu Leu Val Thr Val Gin Leu Met Val Ser Leu 
50 55 60 

Ala Gin Leu He Ser Val His Glu Arg Glu Gin He Met Thr Thr Asn 
65 70 75 80 

Val Trp Leu Thr Gin Glu Trp Glu Asp Tyr Arg Leu Thr Trp Lys Pro 
85 90 95 

Glu Glu Phe Asp Asn Met Lys Lys Val Arg Leu Pro Ser Lys His He 
100 105 HO 

Trp Leu Pro Asp Val Val Leu Tyr Asn Asn Ala Asp Gly Met Tyr Glu 
115 120 125 

Val Ser Phe Tyr Ser Asn Ala Val Val Ser Tyr Asp Gly Ser He Phe 
130 135 140 

Trp Leu Pro Pro Ala He Tyr Lys Ser Ala Cys Lys He Glu Val Lys 
145 150 155 160 

His Phe Pro Phe Asp Gin Gin Asn Cys Thr Met Lys Phe Arg Ser Trp 
165 170 175 

Thr Tyr Asp Arg Thr Glu He Asp Leu Val Leu Lys Ser Glu Val Ala 
180 185 190 

Ser Leu Asp Asp Phe Thr Pro Ser Gly Glu Trp Asp He Val Ala Leu 
195 200 205 

Pro Gly Arg Arg Asn Glu Asn Pro Asp Asp Ser Thr Tyr Val Asp He 
210 215 220 

Thr Tyr Asp Phe He He Arg Arg Lys Pro Leu Phe Tyr Thr He Asn 
225 230 235 240 

Leu He He Pro Cys Val Leu He Thr Ser Leu Ala He Leu Val Phe 
245 250 255 

Tyr Leu Pro Ser Asp Cys Gly Glu Lys Met Thr Leu Cys lie Ser Val 
260 265 270 

Leu Leu Ala Leu Thr Val Phe Leu Leu Leu He Ser Lys He Val Pro 
275 280 285 

Pro Thr Ser Leu Asp Val Pro Leu Val Gly Lys Tyr Leu Met Phe Thr 
290 295 300 

Met Val Leu Val Thr Phe Ser He Val Thr Ser Val Cys Val Leu Asn 
305 310 315 320 

Val His His Arg Ser Pro Thr Thr His Thr Met Ala Pro Trp Val Lys 
325 330 335 

Val Val Phe Leu Glu Lys Leu Pro Ala Leu Leu Phe Met Gin Gin Pro 
340 345 350 

Arg His His Cys Ala Arg Gin Arg Leu Arg Leu Arg Arg Arg Gin Arg 
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355 360 365 

Glu Arg Glu Gly Ala Gly Ala Leu Phe Phe Arg Glu Ala Pro Gly Ala 
370 375 380 

Asp Ser Cys Thr Cys Phe Val Asn Arg Ala Ser Val Gin Gly Leu Ala 
385 390 395 400 

Gly Ala Phe Gly Ala Glu Pro Ala Pro Val Ala Gly Pro Gly Arg Ser 
405 410 415 

Gly Glu Pro Cys Gly Cys Gly Leu Arg Glu Ala Val Asp Gly Val Arg 
420 425 430 

Phe lie Ala Asp His Met Arg Ser Glu Asp Asp Asp Gin Ser Val Ser 
435 440 445 

Glu Asp Trp Lys Tyr Val Ala Met Val lie Asp Arg Leu Phe Leu Trp 
450 455 460 

He Phe Val Phe Val Cys Val Phe Gly Thr He Gly Met Phe Leu Gin 
465 470 475 480 

Pro Leu Phe Gin Asn Tyr Thr Thr Thr Thr Phe Leu His Ser Asp His 
485 490 495 

Ser Ala Pro Ser Ser Lys 
500 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1927 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 98.. 1474 

(D) OTHER INFORMATION: /product= "BETA -3 SUBUNIT" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TCGGAACCCC TGTATTTTCT TTTCAAAACC CCCTTTTCCA GTGGAAATGC TCTGTTGTTA 60 

AAAAGGAAGA AACTGTCTTT CTGAAACTGA CATCACG ATG CTC CCA GAT TTT ATG 115 

Met Leu Pro Asp Phe Met 
1 5 

CTG GTT CTC ATC GTC CTT GGC ATC CCT TCC TCA GCC ACC ACA GGT TTC 163 
Leu Val Leu He Val Leu Gly He Pro Ser Ser Ala Thr Thr Gly Phe 
10 15 20 

AAC TCA ATC GCC GAA AAT GAA GAT GCC CTC CTC AGA CAT TTG TTC CAA 211 
Asn Ser He Ala Glu Asn Glu Asp Ala Leu Leu Arg His Leu Phe Gin 
25 30 35 
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GGT TAT CAG AAA TGG GTC CGC CCT GTA TTA CAT TCT AAT GAC ACC ATA 259 
Gly Tyr Gin Lys Trp Val Arg Pro Val Leu His Ser Asn Asp Thr lie 
40 45 50 

AAA GTA TAT TTT GGA TTG AAA ATA TCC CAG CTT GTA GAT GTG GAT GAA 307 
Lys Val Tyr Phe Gly Leu Lys lie Ser Gin Leu Val Asp Val Asp Glu 
55 60 65 70 

AAG AAT CAG CTG ATG AC A ACC AAT GTG TGG CTC AAA CAG GAA TGG AC A 355 
Lys Asn Gin Leu Met Thr Thr Asn Val Trp Leu Lys Gin Glu Trp Thr 
75 80 85 

GAC CAC AAG TTA CGC TGG AAT CCT GAT GAT TAT GGT GGG ATC CAT TCC 403 
Asp His Lys Leu Arg Trp Asn Pro Asp Asp Tyr Gly Gly lie His Ser 
90 95 100 

ATT AAA GTT CCA TCA GAA TCT CTG TGG CTT CCT GAC ATA GTT CTC TTT 451 
lie Lys Val Pro Ser Glu Ser Leu Trp Leu Pro Asp lie Val Leu Phe 
105 110 115 

GAA AAT GCT GAC GGC CGC TTC GAA GGC TCC CTG ATG ACC AAG GTC ATC 499 
Glu Asn Ala Asp Gly Arg Phe Glu Gly Ser Leu Met Thr Lys Val He 
120 125 130 

GTG AAA TCA AAC GGA ACT GTT GTC TGG ACC CCT CCC GCC AGC TAC AAA 547 
Val Lys Ser Asn Gly Thr Val Val Trp Thr Pro Pro Ala Ser Tyr Lys 
135 140 145 150 

AGC TCC TGC ACC ATG GAC GTC ACG TTT TTC CCG TTC GAC CGA CAG AAC 595 
Ser Ser Cys Thr Met Asp Val Thr Phe Phe Pro Phe Asp Arg Gin Asn 
155 160 165 

TGC TCC ATG AAG TTT GGA TCC TGG ACT TAT GAT GGC ACC ATG GTT GAC 643 
Cys Ser Met Lys Phe Gly Ser Trp Thr Tyr Asp Gly Thr Met Val Asp 
170 175 180 

CTC ATT TTG ATC AAT GAA AAT GTC GAC AGA AAA GAC TTC TTC GAT AAC 691 
Leu He Leu He Asn Glu Asn Val Asp Arg Lys Asp Phe Phe Asp Asn 
185 190 195 

GGA GAA TGG GAA ATA CTG AAT GCA AAG GGG ATG AAG GGG AAC AGA AGG 739 
Gly Glu Trp Glu He Leu Asn Ala Lys Gly Met Lys Gly Asn Arg Arg 
200 205 210 

GAC GGC GTG TAC TCC TAT CCC TTT ATC ACG TAT TCC TTC GTC CTG AGA 787 
Asp Gly Val Tyr Ser Tyr Pro Phe lie Thr Tyr Ser Phe Val Leu Arg 
215 220 225 230 

CGC CTG CCT TTA TTC TAT ACC CTC TTT CTC ATC ATC CCC TGC CTG GGG 835 
Arg Leu Pro Leu Phe Tyr Thr Leu Phe Leu He He Pro Cys Leu Gly 
235 240 245 

CTG TCT TTC CTA ACA GTT CTT GTG TTC TAT TTA CCT TCG GAT GAA GGA 883 
Leu Ser Phe Leu Thr Val Leu Val Phe Tyr Leu Pro Ser Asp Glu Gly 
250 255 260 

GAA AAA CTT TCA TTA TCC ACA TCG GTC TTG GTT TCT CTG ACA GTT TTC 931 
Glu Lys Leu Ser Leu Ser Thr Ser Val Leu Val Ser Leu Thr Val Phe 
265 270 275 

CTT TTA GTG ATT GAA GAA ATC ATC CCA TCG TCT TCC AAA GTC ATT CCT 979 
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Leu Leu Val lie Glu Glu lie lie Pro Ser Ser Ser Lys Val lie Pro 
280 285 290 

CTC ATT GGA GAG TAC CTG CTG TTC ATC ATG ATT TTT GTG ACC CTG TCC 1027 
Leu lie Gly Glu Tyr Leu Leu Phe lie Met lie Phe Val Thr Leu Ser 
295 300 305 310 

ATC ATT GTT ACC GTG TTT GTC ATT AAC GTT CAC CAC AGA TCT TCT TCC 1075 
lie lie Val Thr Val Phe Val lie Asn Val His His Arg Ser Ser Ser 
315 320 325 

ACG TAC CAC CCC ATG GCC CCC TGG GTT AAG AGG CTC TTT CTG CAG AAA 1123 
Thr Tyr His Pro Met Ala Pro Trp Val Lys Arg Leu Phe Leu Gin Lys 
330 335 340 

CTT CCA AAA TTA CTT TGC ATG AAA GAT CAT GTG GAT CGC TAC TCA TCC 1171 
Leu Pro Lys Leu Leu Cys Met Lys Asp His Val Asp Arg Tyr Ser Ser 
345 350 355 

CCA GAG AAA GAG GAG AGT CAA CCA GTA GTG AAA GGC AAA GTC CTC GAA 1219 
Pro Glu Lys Glu Glu Ser Gin Pro Val Val Lys Gly Lys Val Leu Glu 
360 365 370 

AAA AAG AAA CAG AAA CAG CTT AGT GAT GGA GAA AAA GTT CTA GTT GCT 1267 
Lys Lys Lys Gin Lys Gin Leu Ser Asp Gly Glu Lys Val Leu Val Ala 
375 380 385 390 

TTT TTG GAA AAA GCT GCT GAT TCC ATT AGA TAC ATT TCC AGA CAT GTG 1315 
Phe Leu Glu Lys Ala Ala Asp Ser lie Arg Tyr lie Ser Arg His Val 
395 400 405 

AAG AAA GAA CAT TTT ATC AGC CAG GTA GTA CAA GAC TGG AAA TTT GTA 1363 
Lys Lys Glu His Phe lie Ser Gin Val Val Gin Asp Trp Lys Phe Val 
410 415 420 

GCT CAA GTT CTT GAC CGA ATC TTC CTG TGG CTC TTT CTG ATA GTG TCA 1411 
Ala Gin Val Leu Asp Arg lie Phe Leu Trp Leu Phe Leu lie Val Ser 
425 430 435 

GCA AC A GGC TCG GTT CTG ATT TTT ACC CCT GCT TTG AAG ATG TGG CTA 1459 
Ala Thr Gly Ser Val Leu lie Phe Thr Pro Ala Leu Lys Met Trp Leu 
440 445 450 



CAT AGT TAC CAT TAGGAATTTC AAAAGACATA AGTACTAAAT TACACCTTAG 


1511 


His Ser Tyr His 












455 














ACCTGACATC 


TGGCTATCAC 


ACAGACAGAA 


TCCAAATGCA 


TGTGCTTGTT 


CTACGAACCC 


1571 


CGAATGCGTT 


GTCTTTGTGG 


AAATGGAACA 


TCTCCTCATG 


GGAGAAACTC 


TGGTAAATGT 


1631 


GCTCATTTGT 


GGTTGCCATG 


AGAGTGAGCT 


GCTTTTAAAG 


AAAGTGGAGC 


CTCCTCAGAC 


1691 


CCCTGCCTTG 


GCTTTCCCAO 


ACATTCAGGG 


AGGGATCATA 


GGTCCAGGCT 


TGAGCTCACA 


1751 


TGTGGCCAGA 


GTGCACAAAA 


AGCTGTTGCT 


ACTTGGTGGA 


GGAACACCTC 


CTAGAAGCAG 


1811 


CAGGCCTCGG 


TGGTGGGGGA 


GGGGGGATTC 


ACCTGGAATT 


AAGGAAGTCT 


CGGTGTCGAG 


1871 


CTATCTGTGT 


GGGCAGAGCC 


TGGATCTCCC 


ACCCTGCACT 


GGCCTCCTTG 


GTGCCG 


1927 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Leu Pro Asp Phe Met Leu Val Leu lie Val Leu Gly lie Pro Ser 
15 10 15 

Ser Ala Thr Thr Gly Phe Asn Ser lie Ala Glu Asn Glu Asp Ala Leu 
20 25 30 

Leu Arg His Leu Phe Gin Gly Tyr Gin Lys Trp Val Arg Pro Val Leu 
35 40 45 

His Ser Asn Asp Thr lie Lys Val Tyr Phe Gly Leu Lys lie Ser Gin 
50 55 60 

Leu Val Asp Val Asp Glu Lys Asn Gin Leu Met Thr Thr Asn Val Trp 
65 70 75 80 

Leu Lys Gin Glu Trp Thr Asp His Lys Leu Arg Trp Asn Pro Asp Asp 
85 90 95 

Tyr Gly Gly lie His Ser lie Lys Val Pro Ser Glu Ser Leu Trp Leu 
100 105 110 

Pro Asp lie Val Leu Phe Glu Asn Ala Asp Gly Arg Phe Glu Gly Ser 
115 120 125 

Leu Met Thr Lys Val lie Val Lys Ser Asn Gly Thr Val Val Trp Thr 
130 135 140 

Pro Pro Ala Ser Tyr Lys Ser Ser Cys Thr Met Asp Val Thr Phe Phe 
145 150 155 160 

Pro Phe Asp Arg Gin Asn Cys Ser Met Lys Phe Gly Ser Trp Thr Tyr 
165 170 175 

Asp Gly Thr Met Val Asp Leu lie Leu lie Asn Glu Asn Val Asp Arg 
180 185 190 

Lys Asp Phe Phe Asp Asn Gly Glu Trp Glu lie Leu Asn Ala Lys Gly 
195 200 205 

Met Lys Gly Asn Arg Arg Asp Gly Val Tyr Ser Tyr Pro Phe lie Thr 
210 215 220 

Tyr Ser Phe Val Leu Arg Arg Leu Pro Leu Phe Tyr Thr Leu Phe Leu 
225 230 235 240 

lie lie Pro Cys Leu Gly Leu Ser Phe Leu Thr Val Leu Val Phe Tyr 
245 250 255 

Leu Pro Ser Asp Glu Gly Glu Lys Leu Ser Leu Ser Thr Ser Val Leu 
260 265 270 
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Val Ser Leu Thr Val Phe Leu Leu Val He Glu Glu He He Pro Ser 
275 280 285 

Ser Ser Lys Val He Pro Leu He Gly Glu Tyr Leu Leu Phe He Met 
290 295 300 

He Phe Val Thr Leu Ser He He Val Thr Val Phe Val He Asn Val 
305 310 315 320 

His His Arg Ser Ser Ser Thr Tyr His Pro Met Ala Pro Trp Val Lys 
325 330 335 

Arg Leu Phe Leu Gin Lys Leu Pro Lys Leu Leu Cys Met Lys Asp His 
340 345 350 

Val Asp Arg Tyr Ser Ser Pro Glu Lys Glu Glu Ser Gin Pro Val Val 
355 360 365 

Lys Gly Lys Val Leu Glu Lys Lys Lys Gin Lys Gin Leu Ser Asp Gly 
370 375 380 

Glu Lys Val Leu Val Ala Phe Leu Glu Lys Ala Ala Asp Ser He Arg 
385 390 395 400 

Tyr He Ser Arg His Val Lys Lys Glu His Phe He Ser Gin Val Val 
405 410 415 

Gin Asp Trp Lys Ph,- Val Ala Gin Val Leu Asp Arg He Phe Leu Trp 
420 425 430 

Leu Phe Leu He Val Ser Ala Thr Gly Ser Val Leu He Phe Thr Pro 
435 J 440 445 

Ala Leu Lys Met Trp Leu His Ser Tyr His 
450 455 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 87.. 1583 

(D) OTHER INFORMATION: /product = "BETA- 4 SUBUNIT" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CCGGCGCTCA CTCGACCGCG CGGCTCACGG GTGCCCTGTG ACCCCACAGC GGAGCTCGCG 60 

GCGGCTGCCA CCCGGCCCCG CCGGCCATGA GGCGCGCGCC TTCCCTGGTC CTTTTCTTCC 120 

TGGTCGCCCT TTGCGGGCGC GGGAACTGCC GCGTGGCCAA TGCGGAGGAA AAGCTGATGG 180 
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ACGACCTTCT 


GAACAAAACC 


CGTTACAATA 


ACCTGATCCG 


CCCAGCCACC 


AGCTCCTCAC 


240 


AGCTCATCTC 


CATCAAGCTG 


CAGCTCTCCC 


TGGCCCAGCT 


TATCAGCGTG 


AATGAGCGAG 


300 


AGCAGATCAT 


GACCACCAAT 


GTCTGGCTGA 


AACAGGAATG 


GACTGATTAC 


CGCCTGACCT 


360 


GGAACAGCTC 


CCGCTACGAG 


GGTGTGAACA 


TCCTGAGGAT 


CCCTGCAAAG 


CGCATCTGGT 


420 


TGCCTGACAT 


CGTGCTTTAC 


AACAACGCCG 


ACGGGACCTA 


TGAGGTGTCT 


GTCTACACCA 


480 


ACTTGATAGT 


CCGGTCCAAC 


GGCAGCGTCC 


TGTGGCTGCC 


CCCTGCCATC 


TACAAGAGCG 


540 


CCTGCAAGAT 


TGAGGTGAAG 


TACTTTCCCT 


TCGACCAGCA 


GAACTGCACC 


CTCAAGTTCC 


600 


GCTCCTGGAC 


CTATGACCAC 


ACGGAGATAG 


ACATGGTCCT 


CATGACGCCC 


ACAGCCAGCA 


660 


TGGATGACTT 


TACTCCCAGT 


GGTGAGTGGG 


ACATAGTGGC 


CCTCCCAGGG 


AGAAGGACAG 


720 


TGAACCCACA 


AGACCCCAGC 


TACGTGGACG 


TGACTTACGA 


CTTCATCATC 


AAGCGCAAGC 


780 


CTCTGTTCTA 


CACCATCAAC 


CTCATCATCC 


CCTGCGTGCT 


CACCACCTTG 


CTGGCCATCC 


840 


TCGTCTTCTA 


CCTGCCATCC 


GACTGCGGCG 


AGAAGATGAC 


ACTGTGCATC 


TCAGTGCTGC 


900 


TGGCACTGAC 


ATTCTTCCTG 


CTGCTCATCT 


CCAAGATCGT 


GCCACCCACC 


TCCCTCGATG 


960 


TGCCTCTCAT 


CGGCAAGTAC 


CTCATGTTCA 


CCATGGTGCT 


GGTCACCTTC 


TCCATCGTCA 


1020 


CCAGCGTCTG 


TGTGCTCAAT 


GTGCACCACC 


GCTCGCCCAG 


CACCCACACC 


ATGGCACCCT 


1080 


GGGTCAAGCG 


CTGCTTCCTG 


CACAAGCTGC 


CTACCTTCCT 


CTTCATGAAG 


CGCCCTGGCC 


1140 


CCGACAGCAG 


CCCGGCCAGA 


GCCTTCCCGC 


CCAGCAAGTC 


ATGCGTGACC 


AAGCCCGAGG 


1200 


CCACCGCCAC 


CTCCACCAGC 


CCCTCCAACT 


TCTATGGGAA 


CTCCATGTAC 


TTTGTGAACC 


1260 


CCGCCTCTGC 


AGCTTCCAAG 


TCTCCAGCCG 


GCTCTACCCC 


GGTGGCTATC 


CCCAGGGATT 


1320 


TCTGGCTGCG 


GTCCTCTGGG 


AGGTTCCGAC 


AGGATGTGCA 


GGAGGCATTA 


GAAGGTGTCA 


1380 


GCTTCATCGC 


CCAGCACATG 


AAGAATGACG 


ATGAAGACCA 


GAGTGTCGTT 


GAGGACTGGA 


1440 


AGTACGTGGC 


TATGGTGGTG 


GACCGGCTGT 


TCCTGTGGGT 


GTTCATGTTT 


GTGTGCGTCC 


1500 


TGGGCACTGT 


GGGGCTCTTC 


CTGCCGCCCC 


TCTTCCAGAC 


CCATGCAGCT 


TCTGAGGGGC 


1560 


CCTACGCTGC 


CCAGCGTGAC 


TGAGGGCCCC 


CTGGGTTGTG 


GGGTGAGAGG 


ATGTGAGTGG 


1620 


CCGGGTGGGC 


ACTTTGCTGC 


TTCTTTCTGG 


GTTGTGGCCG 


ATGAGGCCCT 


AAGTAAATAT 


1680 


GTGAGCATTG 


GCCATCAACC 


CCATCAAACC 


AGCCACAGCC 


GTGGAACAGG 


CAAGGATGGG 


1740 


GGCCTGGCCT 


GTCCTCTCTG 


AATGCCTTGG 


AGGGATCCCA 


GGAAGCCCCA 


GTAGGAGGGA 


1800 


GCTTCAGACA 


GTTCAATTCT 


GGCCTGTCTT 


CCTTCCCTGC 


ACCGGGCAAT 


GGGGATAAAG 


1860 


ATGACTTCGT 


AGCAGCACCT 


ACTATGCTTC 


AGGCATGGTG 


CCGGCCTGCC 


TCTCC 


1915 



(2) INFORMATION FOR SEQ ID NO: 18: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 498 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

<ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Arg Arg Ala Pro Ser Leu Val Leu Phe Phe Leu Val Ala Leu Cys 
15 10 15 

Gly Arg Gly Asn Cys Arg Val Ala Asn Ala Glu Glu Lys Leu Met Asp 
20 25 30 

Asp Leu Leu Asn Lys Thr Arg Tyr Asn Asn Leu lie Arg Pro Ala Thr 
35 40 45 

Ser Ser Ser Gin Leu lie Ser lie Lys Leu Gin Leu Ser Leu Ala Gin 
50 55 60 

Leu lie Ser Val Asn Glu Arg Glu Gin lie Met Thr Thr Asn Val Trp 
65 70 75 80 

Leu Lys Gin Glu Trp Thr Asp Tyr Arg Leu Thr Trp Asn Ser .Ser Arg 
85 90 95 

Tyr Glu Gly Val Asn lie Leu Arg lie Pro Ala Lys Arg lie Trp Leu 
100 105 110 

Pro Asp lie Val Leu Tyr Asn Asn Ala Asp Gly Thr Tyr Glu Val Ser 
115 120 125 

Val Tyr Thr Asn Leu lie Val Arg Ser Asn Gly Ser Val Leu Trp Leu 
130 135 140 

Pro Pro Ala lie Tyr Lys Ser Ala Cys Lys lie Glu Val Lys Tyr Phe 
145 150 155 160 

Pro Phe Asp Gin Gin Asn Cys Thr Leu Lys Phe Arg Ser Trp Thr Tyr 
165 170 175 

Asp His Thr Glu lie Asp Met Val Leu Met Thr Pro Thr Ala Ser Met 
180 185 190 

Asp Asp Phe Thr Pro Ser Gly Glu Trp Asp lie Val Ala Leu Pro Gly 
195 200 205 

Arg Arg Thr Val Asn Pro Gin Asp Pro Ser Tyr Val Asp Val Thr Tyr 
210 215 220 

Asp Phe lie lie Lys Arg Lys Pro Leu Phe Tvr Thr lie Asn Leu lie 
225 230 235 240 

lie Pro Cys Val Leu Thr Thr Leu Leu Ala lie Leu Val Phe Tyr Leu 
245 250 255 

Pro Ser Asp Cys Gly Glu Lys Met Thr Leu Cys lie Ser Val Leu Leu 
260 265 270 

Ala Leu Thr Phe Phe Leu Leu Leu lie Ser Lys lie Val Pro Pro Thr 
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275 280 285 

Ser Leu Asp Val Pro Leu lie Gly Lys Tyr Leu Met Phe Thr Met Val 
290 295 300 

Leu Val Thr Phe Ser He Val Thr Ser Val Cys Val Leu Asn Val His 
305 310 315 320 

His Arg Ser Pro Ser Thr His Thr Met Ala Pro Trp Val Lys Arg Cys 
325 330 335 

Phe Leu His Lys Leu Pro Thr Phe Leu Phe Met Lys Arg Pro Gly Pro 
340 345 350 

Asp Ser Ser Pro Ala Arg Ala Phe Pro Pro Ser Lys Ser Cys Val Thr 
355 360 365 

Lys Pro Glu Ala Thr Ala Thr Ser Thr Ser Pro Ser Asn Phe Tyr Gly 
370 375 380 

Asn Ser Met Tyr Phe Val Asn Pro Ala Ser Ala Ala Ser Lys Ser Pro 
385 390 395 400 

Ala Gly Ser Thr Pro Val Ala He Pro Arg Asp Phe Trp Leu Arg Ser 
405 410 415 

Ser Gly Arg Phe Arg Gin Asp Val Gin Glu Ala Leu Glu Gly Val Ser 
420 425 430 

Phe He Ala Gin His Met Lys Asn Asp Asp Glu Asp Gin Ser Val Val 
435 440 445 

Glu Asp Trp Lys Tyr Val Ala Met Val Val Asp Arg Leu Phe Leu Trp 
450 455 460 

Val Phe Met Phe Val Cys Val Leu Gly Thr Val Gly Leu Phe Leu Pro 
465 470 475 480 

Pro Leu Phe Gin Thr His Ala Ala Ser Glu Gly Pro Tyr Ala Ala Gin 
485 490 495 

Arg Asp 
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